BiasFinder: Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems

نویسندگان

چکیده

Artificial Intelligence (AI) software systems, such as Sentiment Analysis (SA) typically learn from large amounts of data that may reflect human biases. Consequently, the machine learning model in systems exhibit unintended demographic bias based on specific characteristics (e.g., gender, occupation, country-of-origin, etc.). Such biases manifest an SA system when it predicts a different sentiment for similar texts differ only characteristic individuals described. Existing studies revealing rely production sentences small set short, predefined templates. To address this limitation, we present BisaFinder, approach to discover biased predictions via metamorphic testing. A key feature BisaFinder is automatic curation suitable templates pieces text corpus, using various Natural Language Processing (NLP) techniques identify words describe characteristics. Next, instantiates new these by filling placeholders with associated class gender-specific female names, she, her). These are used tease out system. identifies bias-uncovering test case detects exhibits pair texts, i.e., male vs. female) target gender). Our empirical evaluation showed can effectively create number realistic and diverse cases uncover high true positive rate up 95.8\%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Synthetic Text Generation for Sentiment Analysis

Natural language is a common type of input for data processing systems. Therefore, it is often required to have a large testing data set of this type. In this context, the task to automatically generate natural language texts, which maintain the properties of real texts is desirable. However, current synthetic data generators do not capture natural language text data sufficiently. In this paper...

متن کامل

Addressing Age-Related Bias in Sentiment Analysis

Computational approaches to text analysis are useful in understanding aspects of online interaction, such as opinions and subjectivity in text. Yet, recent studies have identified various forms of bias in language-based models, raising concerns about the risk of propagating social biases against certain groups based on sociodemographic factors (e.g., gender, race, geography). In this study, we ...

متن کامل

Statistical Test To Uncover Process Variations

Meeting the tight performance specifications mandated by the customer is critical for contract manufactured ASICs. To address this, at speed test has been employed to detect subtle delay failures in manufacturing. However, the increasing process spread in advanced nanometer ASICs poses considerable challenges to predicting hardware performance from timing models. Performance verification in the...

متن کامل

Test Generation for Duration Systems

We are interested in generating tests for duration systems modeling real-time systems. The specification of a duration system is given as a duration graph. Duration graphs are an extension of timed graphs and are suitable for modeling the accumulated times spent by computations within the considered duration system. In this paper, we present a method for generating tests for duration systems ba...

متن کامل

Uncover: Using Coverability Analysis for Verifying Graph Transformation Systems

UNCOVER is a tool for high level verification of distributed or concurrent systems. It uses graphs and graph transformation rules to model these systems in a natural way. Errors in such a system are modelled by upward-closed sets for which two orders are provided, the subgraph and the minor ordering. We can then exploit the theory of well-structured transition systems to obtain exact or approxi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Software Engineering

سال: 2021

ISSN: ['0098-5589', '1939-3520', '2326-3881']

DOI: https://doi.org/10.1109/tse.2021.3136169